AITopics | abstract representation

Self-Normalizing Neural Networks

Neural Information Processing SystemsMar-17-2026, 14:36:30 GMT

Deep Learning has revolutionized vision via convolutional neural networks (CNNs) and natural language processing via recurrent neural networks (RNNs). However, success stories of Deep Learning with standard feed-forward neural networks (FNNs) are rare. FNNs that perform well are typically shallow and, therefore cannot exploit many levels of abstract representations. We introduce self-normalizing neural networks (SNNs) to enable high-level abstract representations. While batch normalization requires explicit normalization, neuron activations of SNNs automatically converge towards zero mean and unit variance. The activation function of SNNs are scaled exponential linear units (SELUs), which induce self-normalizing properties.

artificial intelligence, machine learning, normalization, (10 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

5ca41a86596a5ed567d15af0be224952-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 13:44:21 GMT

agent, exploration, representation, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-8-2026, 13:44:10 GMT

abstract representation, exploration, forward intrinsic reward planning, (12 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

Compositional generalization through abstract representations in human and artificial neural networks

Neural Information Processing SystemsDec-25-2025, 08:29:20 GMT

Humans have a remarkable ability to rapidly generalize to new tasks that is difficult to reproduce in artificial learning systems.Compositionality has been proposed as a key mechanism supporting generalization in humans, but evidence of its neural implementation and impact on behavior is still scarce. Here we study the computational properties associated with compositional generalization in both humans and artificial neural networks (ANNs) on a highly compositional task. First, we identified behavioral signatures of compositional generalization in humans, along with their neural correlates using whole-cortex functional magnetic resonance imaging (fMRI) data. Next, we designed pretraining paradigms aided by a procedure we term primitives pretraining to endow compositional task elements into ANNs. We found that ANNs with this prior knowledge had greater correspondence with human behavior and neural compositional signatures. Importantly, primitives pretraining induced abstract internal representations, excellent zero-shot generalization, and sample-efficient learning. Moreover, it gave rise to a hierarchy of abstract representations that matched human fMRI data, where sensory rule abstractions emerged in early sensory areas, and motor rule abstractions emerged in later motor areas. Our findings give empirical support to the role of compositional generalization in humans behavior, implicate abstract representations as its neural implementation, and illustrate that these representations can be embedded into ANNs by designing simple and efficient pretraining procedures.

abstract representation, compositional generalization, generalization, (7 more...)

Neural Information Processing Systems

Industry:

Health & Medicine > Health Care Technology (0.82)
Health & Medicine > Diagnostic Medicine > Imaging (0.59)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.65)

Add feedback

Self-Normalizing Neural Networks

Neural Information Processing SystemsNov-21-2025, 15:21:24 GMT

Deep Learning has revolutionized vision via convolutional neural networks (CNNs) and natural language processing via recurrent neural networks (RNNs). However, success stories of Deep Learning with standard feed-forward neural networks (FNNs) are rare. FNNs that perform well are typically shallow and, therefore cannot exploit many levels of abstract representations. We introduce self-normalizing neural networks (SNNs) to enable high-level abstract representations. While batch normalization requires explicit normalization, neuron activations of SNNs automatically converge towards zero mean and unit variance. The activation function of SNNs are scaled exponential linear units (SELUs), which induce self-normalizing properties.

name change, normalization, self-normalizing neural network, (7 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A mathematical theory for understanding when abstract representations emerge in neural networks

Wang, Bin, Johnston, W. Jeffrey, Fusi, Stefano

arXiv.org Machine LearningOct-15-2025

Recent experiments reveal that task-relevant variables are often encoded in approximately orthogonal subspaces of the neural activity space. These disentangled low-dimensional representations are observed in multiple brain areas and across different species, and are typically the result of a process of abstraction that supports simple forms of out-of-distribution generalization. The mechanisms by which such geometries emerge remain poorly understood, and the mechanisms that have been investigated are typically unsupervised (e.g., based on variational auto-encoders). Here, we show mathematically that abstract representations of latent variables are guaranteed to appear in the last hidden layer of feedforward nonlinear networks when they are trained on tasks that depend directly on these latent variables. These abstract representations reflect the structure of the desired outputs or the semantics of the input stimuli. To investigate the neural representations that emerge in these networks, we develop an analytical framework that maps the optimization over the network weights into a mean-field problem over the distribution of neural preactivations. Applying this framework to a finite-width ReLU network, we find that its hidden layer exhibits an abstract representation at all global minima of the task objective. We further extend these analyses to two broad families of activation functions and deep feedforward architectures, demonstrating that abstract representations naturally arise in all these scenarios. Together, these results provide an explanation for the widely observed abstract representations in both the brain and artificial neural networks, as well as a mathematically tractable toolkit for understanding the emergence of different kinds of representations in task-optimized, feature-learning network models.

artificial intelligence, machine learning, representation, (19 more...)

arXiv.org Machine Learning

2510.09816

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.04)
North America > United States > New York > New York County > New York City (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

5ca41a86596a5ed567d15af0be224952-Paper.pdf

Neural Information Processing SystemsOct-3-2025, 00:36:56 GMT

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Neural Information Processing SystemsOct-3-2025, 00:36:45 GMT

abstract representation, artificial intelligence, exploration, (13 more...)

Neural Information Processing Systems

Genre: Research Report (0.52)

Technology: Information Technology > Artificial Intelligence (0.38)

Add feedback

d0241a0fb1fc9be477bdfde5e0da276a-Paper-Conference.pdf

Neural Information Processing SystemsAug-19-2025, 02:35:02 GMT

Humans can efficiently transfer prior knowledge to novel contexts, an ability commonly referred to as transfer learning.

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country:

North America > United States > Nebraska > Lancaster County > Lincoln (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.69)

Industry:

Health & Medicine > Therapeutic Area > Neurology (1.00)
Health & Medicine > Health Care Technology (0.70)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Learning Abstract World Models with a Group-Structured Latent Space

Delliaux, Thomas, Vu, Nguyen-Khanh, François-Lavet, Vincent, van der Pol, Elise, Rachelson, Emmanuel

arXiv.org Artificial IntelligenceJun-3-2025

Learning meaningful abstract models of Markov Decision Processes (MDPs) is crucial for improving generalization from limited data. In this work, we show how geometric priors can be imposed on the low-dimensional representation manifold of a learned transition model. We incorporate known symmetric structures via appropriate choices of the latent space and the associated group actions, which encode prior knowledge about invariances in the environment. In addition, our framework allows the embedding of additional unstructured information alongside these symmetries. We show experimentally that this leads to better predictions of the latent transition model than fully unstructured approaches, as well as better learning on downstream RL tasks, in environments with rotational and translational features, including in first-person views of 3D environments. Additionally, our experiments show that this leads to simpler and more disentangled representations. The full code is available on GitHub to ensure reproducibility.

machine learning, reinforcement learning, transition, (16 more...)

arXiv.org Artificial Intelligence

2506.01529

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Filters

Collaborating Authors

abstract representation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Self-Normalizing Neural Networks

5ca41a86596a5ed567d15af0be224952-Paper.pdf

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

Compositional generalization through abstract representations in human and artificial neural networks

Self-Normalizing Neural Networks

A mathematical theory for understanding when abstract representations emerge in neural networks

5ca41a86596a5ed567d15af0be224952-Paper.pdf

5ca41a86596a5ed567d15af0be224952-AuthorFeedback.pdf

d0241a0fb1fc9be477bdfde5e0da276a-Paper-Conference.pdf

Learning Abstract World Models with a Group-Structured Latent Space